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TIME: 13:34:27 



INPUT SET: S33766.raw 



This Raw Listing contains the General 
Information Section and up to the first^ ^a^s^ 
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(1) 



SEQUENCE LISTING 

General information: 
(i) APPLICANT: Das , Goutam 

TITLE OF INVENTION: DNA Molecules for Expression of 
Polypeptides 



(iii) NUMBER OF SEQUENCES: 4 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: White & Case 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 10036-2787 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/624,3 98 

(B) FILING DATE: 04-APR-1996 

(Mii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/SE96/003 18 

(B) FILING DATE: 12-MAR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: SE 9501939-4 

(B) FILING DATE: 24-MAY-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Thelma A. Chen Cleland 

(B) REGISTRATION NUMBER: ^0,948 

(C) REFERENCE/DOCKET NUMBER: 1103326-0206 
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RAW SEQUENCE LISTING DATE 10 26 1999 

PATENT APPLICATION US/09/418,176 TIME: 13:34:28 

LSPUT SET: S33766.raw 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 819-8200 

(B) TELEFAX: {212} 354-8113 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

fiv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: mammary gland 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 82.. 2319 

(D) OTHER INFORMATION: /product= "bile- salt -St imulated 

lipase " 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 985., 1173 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1174.. 1377 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1378.. 1575 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1576.. 2415 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 151.-2316 

(ix) FEATURE: 

(A) NAME/KEY: polyA_signal 

(B) LOCATION: 2397.. 2402 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/418J76 



DATE: 10/26/1999 
TIME: 13:34:28 



INPUT SET: S33766,raw 
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124 
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(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 



repeat_region 
1756 . .2283 



5 ' UTR 
1 . . 81 



repeat_unit 
1756 . . 1788 



repeat_unit 
1789 . . 1821 



repeat_unit 
1822 , .1854 



repeat_unit 
1855 . . 1887 



repeat_uiiit 
1888 . . 1920 



repeat_unit 
1921 . . 1953 



repeat_unit 
1954 . . 1986 



repeat_unit 
1987 . .2019 



repeat_unit 
2020 . .2052 



repeat_unit 
2053 . .2085 



repeat_unit 
2086 , .2118 



RAW SEQUENCE LISTING date lO 26 1999 

PAGE^^ ^A-reWTAPPLICA^^^^ TIME: 13:34 28 

INPVT SET: S33766,raw 

15 3 (ix) FEATURE: 

-L54 (A) NAME/KEY: repeat_unit 

-L55 (B) LOCATION: 2119,. 2151 
156 

15 7 (ix) FEATURE: 

158 (A) NAME/KEY: repeat_unit 

;L59 (B) LOCATION: 2152.. 2184 
160 

161 (ix) FEATURE: 

(A) NAME/KEY: repeat_unit 

(B) LOCATION: 2185.. 2217 

164 

16 5 (ix) FEATURE: 

1^6 (A) NAME/KEY: repeat_unit 

lg7 (B) LOCATION: 2218.-2250 
168 

16 9 (ix) FEATURE: 

170 (A) NAME/KEY: repeat_unit 

171 (B) LOCATION: 2251.. 2283 



174 
175 
176 
177 
178 



180 
181 
182 



172 

173 (x) PUBLICATION INFORMATION: 

(A) AUTHORS: Nilsson, Jeanette 
Blackberg, Lars 
Carlsson, Peter 
Enerback, Sven 
Hernell, Olle 

^7 5 Bjursell, Gunnar 

(B) TITLE: cDNA cloning of human-milk 
bile-salt-stimulated lipase and evidence for its 
identity to pancreatic carboxylic ester hydrolase 

^Q3 (c) JOURNAL: Eur. J. Biochem. 

184 (D) VOLUME: 192 

185 (F) PAGES: 543-550 

186 (G) DATE: Sept --1990 
187 
188 
189 

III ACCTTCTGTA TCAGTTAAGT GTCAAGATGG AAGGAACAGC AGTCTCAAGA TAATGCAAAG 6 0 

^93 AGTTTATTCA TCCAGAGGCT G ATG CTC ACC ATG GGG CGC CTG CAA CTG GTT HI 

Met Leu Thr Met Gly Arg Leu Gin Leu Val 

-23 -20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



194 
195 

111 GTG TTG GGC CTC ACC TGC TGC TGG GCA GTG GCG AGT GCC GCG AAG CTG 159 

198 val Leu Gly Leu Thr Cys Cys Trp Ala Val Ala Ser Ala Ala Lys Leu 

199 -10 ^ 

-Tl GGC GCC GTG TAG ACA GAA GGT GGG TTC GTG GAA GGC GTC AAT AAG AAG 207 

2 02 Gly Ala val Tyr Thr Glu Gly Gly Phe Val Glu Gly Val Asn Lys Lys 



03 5 10 



2^5 CTC GGC CTC CTG GGT GAC TCT GTG GAC ATC TTC AAG GGC ATC CCC TTC 2 55 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/41 8 J 76 



DATE: 10/26/1999 
TIME: 13:34:29 
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207 
208 
209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 

220 
-j^ 

223 
224 
5 

226 
o 

228 
229 
230 
231 
2 3 2 
233 
234 
235 
236 
237 
238 
239 
240 
241 
2 4 2 
243 
244 
245 
246 
247 
248 
249 
250 
251 
2 52 
253 
254 
255 
256 
257 
258 



Leu Gly Leu 
20 

GCA OCT CCC 
Ala Ala Pro 



CAA GGG ACC 
Gin Gly Thr 



ACC ATC ACC 
Thr lie Thr 
70 

AAC ATT TGG 
Asn lie Trp 
85 

GTT ATG ATC 
Val Met lie 
100 

GGG GCC AAC 
Gly Ala Asn 



ACA CGC GGA 
Thr Arg Gly 



CTT GGG TTC 
Leu Gly Phe 
150 

CTT CGG GAT 
Leu Arg Asp 
165 

GCC TTC GGG 
Ala Phe Gly 
180 



Leu 



ACC 
Thr 



CTG 
Leu 
55 

CAG 
Gin 



Gly Asp 
25 

AAG GCC 
Lys Ala 
40 

AAG GCC 
Lys Ala 



GAC AGC 
Asp Ser 



GTG CCC CAG 
Val Pro Gin 



TGG ATC 
Trp lie 



TTC CTC 
Phe Leu 
120 

AAC GTC 
Asn Val 
135 

CTC AGC 
Leu Ser 



TAT 
Tyr 
105 

AAC 
Asn 



ATC 
He 



ACT 
Thr 



CAG CAC ATG 
Gin His Met 



GGG GAC CCC 
Gly Asp Pro 
185 



Ser Val Asp 



CTG GAA AAT 
Leu Glu Asn 



AAG AAC TTC 
Lys Asn Phe 
60 

ACC TAG GGG 
Thr Tyr Gly 
75 

GGC AGG AAG 
Gly Arg Lys 
90 

GGA GGC GCC 
Gly Gly Ala 



AAC TAC CTG 
Asn Tyr Leu 



GTG GTC ACC 
Val Val Thr 
140 

GGG GAC GCC 
Gly Asp Ala 
155 

GCC ATT GCT 
Ala He Ala 
170 

AAC AAC ATC 
Asn Asn He 



He Phe 

30 

CCT CAG 
Pro Gin 
45 

AAG AAG 
Lys Lys 



GAT GAA 
Asp Glu 



CAA GTC 
Gin Val 



TTC CTC 
Phe Leu 
110 

TAT GAC 
Tyr Asp 
125 

TTC AAC 
Phe Asn 



AAT CTG 
Asn Leu 



TGG GTG 
Trp Val 



ACG CTC 
Thr Leu 
190 



INPUT SET: S33766.raw 

Lys Gly He Pro Phe 
35 



CCA CAT 
Pro His 



AGA TGC 
Arg Cys 



GAC TGC 
Asp Cys 
80 

TCC CGG 
Ser Arg 
95 

ATG GGG 
Met Gly 



GGC GAG 
Gly Glu 



TAC CGT 
Tyr Arg 



CCA GGT 
Pro Gly 
160 

AAG AGG 
Lys Arg 
175 

TTC GGG 
Phe Gly 



CCT GGC TGG 
Pro Gly Trp 
50 

CTG CAG GCC 
Leu Gin Ala 
65 

CTG TAC CTC 
Leu Tyr Leu 



GAC CTG CCC 
Asp Leu Pro 



TCC GGC CAT 
Ser Gly His 
115 

GAG ATC GCC 
Glu He Ala 
130 

GTC GGC CCC 
Val Gly Pro 
145 

AAC TAT GGC 
Asn Tyr Gly 



AAT ATC GCG 
Asn He Ala 



GAG TCT GCT 
Glu Ser Ala 
195 



GGA GGT GCC AGC GTC TCT CTG CAG ACC CTC TCC CCC TAC AAC AAG GGC 

Gly Gly Ala Ser Val Ser Leu Gin Thr Leu Ser Pro Tyr Asn Lys Gly 

200 205 210 

CTC ATC CGG CGA GCC ATC AGC CAG AGC GGC GTG GCC CTG AGT CCC TGG 

Leu He Arg Arg Ala He Ser Gin Ser Gly Val Ala Leu Ser Pro Trp 

215 220 225 

GTC ATC CAG AAA AAC CCA CTC TTC TGG GCC AAA AAG GTG GCT GAG AAG 



303 



351 



399 



447 



495 



543 



591 



639 



687 



735 



783 



831 



879 



• f 



PAGE 1 SEQUENCE VERinCATION REPORT DATE 10/26/1999 

PATENT APPLICATION US/09/418,176 TIME: 13 34:29 

INPUT SET: S33 766. raw 



Line 
544 



Error 

Stop Codon at end of sequence removed - no error 



Original Text 

(2) INFORMATION FOR SEQ ID NO:3: 



